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ABSTRACT 


This  paper  prassnts  a  statistical  software  package 
developad  for  ase  on  the  Apple  II  Plus  microcoapiiter, 
modified  with  the  AppLa  Pascal  language  card.   The  program 
addresses  the  following:   determination  of  confidence 
intervals  for  single  and  bivariate  populations;   hypothesis 
testing  for  one  and  two  parameters;   computation  of 
cumulative  distribution  values  for  the  Normal,  Student's  T, 
Chi-square,  F,  Binomial  and  Poisson  distributions; 
computation  of  quantila  values  for  the  Normal,  Student's  T, 
Chi-square  and  F  distributions.   The  program  also  has  the 
capability  to  store,  retrieve,  and  modify  data  for  use  with 
the  statistical  procedures.   The  program  was  written  in  JCSD 
Pascal,  which  because  of  its  portability  indicates  that 
little  or  no  modification  would  be  required  to  use  it  with 
other  computers  which  are  UCSD  Pascal  compatible.   In 
addition,  because  of  Pascal's  blocic  structure,  the  program 
can  be  easily  modified  or  enhanced  to  include  other 
statistical  procedures  which  are  of  interest  to  the  user. 
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I.   INTRODUCTION 

For  the  operations  analyst  or  for  that  matter  anyone  who 
utilizes  the  methodology  and  problam  solving  approach  of  the 
operations  analyst,  soae  form  of  compatational  device  is  a 
necessity.   Cost,  rigid  interface  requirements,  and  a  number 
of  other  factors  have  in  the  past  frustrated  and  stifled  the 
analyst  in  bringing  to  bear  the  necessary  computational 
power  on  his  problem.   In  the  late  1970 •s,  however,  Texas 
Instruments  introduced  a  significant  amount  of  computing 
power  packaged  in  the  programmable  TI-59  calculator. 
Revolutionary  is  perhaps  a  bit  too  strong  as  a  description 
of  the  impact  that  this  and  like  devices  have  made  in  the 
operations  research  community,  but  certainly  most  analysts 
would  agree  that  the  amount  of  computing  power  that  can  now 
be  held  in  one*  s  hand  and  used  to  solve  problems  is  indeed 
remarkable. 

In  spite  of  this  impact,  however,  the  hand-held 
calculator's  contribution  might  even  now  be  waning  and 
yielding  to  a  more  spectacular  capability  found  in  the 
microcomputer.   The  growth  in  the  capability  of  these 
devices  since  the  beginning  of  the  industry  in  1971  has  been 
phenomenal.   Mastrakas  details  this  growth  and  gives  a 
glimpse  of  the  possible  direction  that  the  1980 's  will  see 
in  this  industry  [Ref .  1J  .   One  factor  accounting  for  the 
growth  in  microcomputer  technology  and  one  that  serves  to 
insure  its  future  is  the  intense  competition  that  pervades 
the  industry.   When  one  thinks  of  hand-held  calculators,  he 
thinks  of  Texas  Instruments  or  perhaps  Hewlett-Packard.  When 
one  thinks  of  microcomputers,  he  may  think  of  Apple,  Exidy, 
North  Star,  PET,  TRS-30,  or  a  number  of  other  devices  with 
like  capability. 


A  typical  microcomputer  package  costing  between  S2,000 
and  $2,500  miglit  consist  of  the  following:   the  computer 
(about  the  siza  of  a  saall  portable  typewriter)  with  64  K 
bytes  of  usable  random  access  memory,  two  floppy-disk  drives 
(5  1/4  inch  diameter)  for  prograa  storage,  and  a  black  and 
white  monitor  for  outpit  display,   A  variety  of  prograoaing 
languages  are  also  available  including  the  University  of 
California,  San  Diego  (JCSD)  Pascal,  BASIC,  FORTRAN,  PILOT, 
etc.   Some  of  the  microcomputers  also  have  a  graphics 
capability  allowing  the  user  to  visualize  mathematical 
forms,  plot  graphs,  and  plot  data  observations. 

The  American  populace  is  being  conditioned  through 
current  periodicals  and  news  features  to  expect  an 
increasingly  important  role  for  the  microcomputer  in 
everyday  life  —  from  grocery  shopping  to  environmental 
control  for  the  home.   The  December  1,  1980  issue  of  "u.  S. 
News  and  Horld  Report   predicts  that  eighty  percent  of  the 
Jnited  States*  households  will  have  a  microcomputer  by  1990. 
In  view  of  the  prospective  proliferation  of  thsse  devices 
and  also  the  significaat  computing  power  that  aow  exists, 
the  operations  research  analyst  can  ill  afford  not  to  begin 
to  exploit  the  capabilities  of  microcomputers.   Indeed,  the 
hardware  capabilities  have  grown  and  are  growing  so  rapidly 
that  today  good  compatible  softwara  to  support  these 
impressive  capabilities  is  seriously  lacking  in  specialized 
fields  such  as  operations  research.   It  is  this  software 
deficiency  that  this  thesis  addresses. 


ir.   BACKGROaND 

The  operations  analyst  can,  on  occasion,  be  required  to 
establish  confidence  intervals  or  test  hypotheses  about  an 
unknown  parameter  from  a  known  or  assumed  population.   The 
computer  program  written  as  a  part  of  this  thesis  allows  the 
analyst  to  quickly  and  easily  accomplish  these  tasks  when 
observations  are  from  Hormal,  Expoaential  or  Bernoulli 
populations. 

A.  DESCBIPTIOS  OF  THE  MICROCOMPOTER  SYSTEM 

The  software  develDpment  was  done  on  an  Apple  II  Plus 
microcomputer  with  two  floppy-disk  drives  (5  1/U  inch 
diameter)  as  add-on  peripherals.   The  system  is  equipped 
with  a  language  systea  giving  the  capability  to  use  the 
University  of  California,  San  Diego  (UCSD)  Pascal 
programming  language  as  well  as  the  BASIC  language  which 
comes  resident  with  the  computer.   The  standard  Apple 
computer  has  a  forty  column  output  display  which  makes  it 
compatible  for  use  with  a  standard  television  set.   This 
capability  can  be  enhanced  to  an  eighty  column  display  with 
an  additional  peripheral  device,  provided  a  monitor  is  used 
for  display  in  lieu  of  a  television.   The  outpat  format  for 
the  program  is  written  for  an  eighty  column  display  device; 
however,  using  the  special  built-in  features  of  the  Apple, 
the  format  can  easily  be  made  to  display  split  screen  on  a 
system  not  equipped  with  an  expanded  display  peripheral. 

B.  SOFTWARE  DEVELOPMENT 

Ling  and  Muller  give  several  considerations  which  should 
be  observed  in  the  development  of  software  for  statistical 
analysis  [Ref.  2,  Ref.  3 1  .   Among  these  considerations  are 
the  following: 


1 .   Choice  of  Programming  Lanaiage 

The  Pascal  programming  language  used  in  this 
software  development  effort  has  a  Qumber  of  features  which 
make  it  particularly  attractive  for  use  with  microcomputers. 
First,  Pascal  is  a  very  concise  language.   The  compiler  is 
small  and  compact  and  fits  easily  within  the  available  off- 
line storage  space  of  the  floppy-disk.   Second,  Pascal  is  a 
high-level,  geieral-pirpose  language  Ref,   '4|  .   The 
language  was  originally  introduced  in  1971,  which  is  receni: 
in  comparison  to  most  high-level  languages.   It  was  intended 
to  be  used  in  teaching  new  programmers  good  techniques  and 
style.   The  Pascal  language  fully  exploits  the  fundamental 
concepts  of  structured  programming,  which  is  a  technique 
used  by  many  professional  programmers  to  write  large  complex 
computer  programs   Ref.   4  ,   The  use  of  these  techniques 
facilitates  developing  programs  in  a  modular  fashion  (i.  e. , 
break  the  overall  pactage  into  logical  sub-packages  and 
proceed  to  program,  debug,  and  validate  each  sub-package 
individually) .   The  final  step  in  the  process  is  to  combine 
the  sub-packages  to  form  the  overall  package,   Using  the 
attendant  statistical  package  as  an  example,  six  sub- 
packages  or  modules  CDiprise  the  overall  package.   Each 
module  is  independent  Df  the  others  and  can  stand  alone  when 
compiled  with  the  main  program.   Third,  the  Pascal  language 
performs  arithmetic  computations  significantly  faster  than 
the  BASIC  language,   Pascal  is  often  implemented  as  a 
'pseudo   interpreted  Language  meaning  that  the  text  versions 
of  programs  are  first  compiled  into  a  code  file.   It  is 
during  the  compilation  phase  that  syntax  errors  are  detected 
by  the  compiler  and  brought  to  the  programmer's  attention. 
This  code  file  is  interpreted  and  executed  during  the 
execution  phase  of  the  program.   Host  processors  can  and 
typically  do  interpret  a  Pascal  code  file  significantly 


faster  than  a  corresponiing  BASI3  program  whicti  perforins  the 
same  computations   Ref.   U  . 

2.   Computational  Efficiency 

In  spite  of  thsir  impressive  capabilities, 
microcomputers  are  decidedly  inferior  to  larger  computers  in 
the  two  key  areas  of  computational  speed  and  accuracy. 
Pascal,  as  implemented  on  the  Apple,  vill  only  perform 
computations  using  six  decimal  places  of  accuracy   Ref.  5 
and  displays  only  five  places  past  the  decimal.  Accordingly, 
algorithms  which  are  used  and  work  very  well  on  larger 
computers  might  have  no  chance  of  producing  ths  same  results 
on  microcomputers  simply  because  the  execution  time  is 
excessive,  or  they  rejuire  double  precision  arithmetic. 

C.  PORTABILITY 

A  portable  program  is  one  that  can  be  run  on  a  number  of 
different  computer  systems  [Ref.   2!  .   Programs  written  in 
the  Pascal  language  are  valuable  in  this  regard  in  that  they 
may  be  run  on  a  variety  of  microcomputers  without 
alteration.   Some  of  the  machines  for  which  this  is  true  are 
computers  with  the  following  microprocessors:   8080,  8085, 
Z80,  6502(Apple),  6800,  and  9900.   Portability  is  a  very 
strong  asset  of  the  Pascal  language  even  though  it  is 
accomplished  at  the  expense  of  reduced  computational  speed. 
The  compiled  code  version  of  programs  is  called  'p-code'. 
Each  machine  has  a  special  interpreter  program  which  takes 
the   p-code  and  converts  it  to  a  form  compatible  with  the 
existing  host  aicroprocessor . 

D.  EASILY  RErRIEVABLE  INFORMATION  FILES 

Ling   suggests   the   use   of    help   files    to  allow    the   user    to 
make   efficient    use    of   a    program    [Ref.    2I  .      The    number   of 
information   files   which    are   available   and  their    content   is 
clearly   a   matter   of    judgment    based   on   assumptions   concerning 
the   knowledge   of    the   user    populatiDn   about   tha    procedures 
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used  ia  the  pCDgram.   Anticipating  that  most  of  the  users  of 
the  accompanyiag  statistical  package  will  be  familiar  with 
the  basic  concepts  of  the  procedures  themselves,  only  one 
help  file  which  pertains  to  data  entry  requirements  is 
included. 

E.   SiaPLS  USER    INTERFACE 

The  user  is  called  upon  to  malce  numerical  entries 
throughout  the  program.   Because  Pascal  is  a  strongly  typed 
language,  a  variable  of  type  integer  cannot  be  assigned 
floating  point  values.   Indeed,  this  requirement  is  so 
strict  that  if  the  program  expects  the  user  to  enter  an 
integer  and  he  accidently  enters  a  number  in  decimal  point 
notation,  the  program  will  abort  and  cause  the  entire  system 
to  re-initialize.   situations  such  as  this  are,  of  course, 
undesirable.   aoreover,  all  programs  written  with  the  intent 
of  establishing  a  dialogue  between  the  program  and  the  user 
should  be  as  trouble-free  as  possible  for  the  user  and 
minimize  as  much  as  possible  the  user's  chances  of 
committing  a  fatal  error  when  responding  to  program  prompts 
or  entering  data.   In  general,  schemes  to  accomplish  this 
are  costly  in  terms  of  computational  efficiency  and 
programming  steps;   however,  user  convenience  is  almost 
always  worth  the  costs.   The  problem  with  data  entry  or 
numerical  entry  from  the  keyboard  is  addressed  in  this 
statistical  package  by  making  all  numerical  entries  using 
a  string  and  converting  the  string  to  the  numerical  value 
it  represents,  k   string  is  a  variable  type  which  is  a 
linear  array  of  characters.   For  example,  given  a  string 
variable  called  'S',  which  is  assigned  a  value  of  *235», 
the  character  in  array  position  '  S  [l  I  •  is  •2»,  •s[2]'  is 
•3»,  and  'sfs]*  is  *5*.      Since  the  characters  'O*  through 
•9'  have  corresponding  numerical  American  Standard  Code 
for  Information  Interchange  (ASCII)  values  of  forty-eight 
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through  fifty-seven,  conversion  is  accomplishai  by 
subtracting  forty-eight  from  the  character's  ASCII  value. 
A  Pascal  procedure  which  converts  strings  to  numbers  is 
shown  in  Appendix  B, 

The  advantages  in  using  a  schaie  such  as  this  for  all 
numerical  entries  are  tvro-fold.   First,  the  ussr  can  take 
advantage  of  the  direct  cursor  addressing  available  on  aany 
microcomputer  video  displays  to  correct  a  data  entry  prior 
to  entering  it  into  computer  memory.   This  is  aot  possible 
when  the  program  expects  real  numbers  as  input.   Second,  the 
program  segment  which  converts  the  string  may  return  either 
an  integer  or  a  real  number,  whichever  is  required  by  the 
program.   This  avoids  having  the  user  concerned  with  the 
typing  requirements  demanded  by  the  Pascal  language. 
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III.   ALGORITHMS 

Algorithms  used  far  statistical  computations  on 
microcomputers  should  ba  selected  with  the  goal  of  providing 
the  best  accuracy  achi37able  at  the  minimum  computation 
time.   Given  the  limitation  of  only  six  decimal  places  of 
accuracy  on  the  Apple  microcomputer,  many  algorithms 
requiring  more  precision  in  their  computations  must  be 
rejected.   The  algorithms  remaining  as  candidates  must  be 
carefully  screened  to  insure  that  their  required 
computations  are  reasonable  from  a  time  standpoint  and  that 
they  exercise  fully  the  accuracy  capability  of  the  machine. 

A.   CALCOLATIOM  OF  VARIANCE 

To  illustrate  the  complications  stemming  from  reduced 
accuracy,  consider  the  following  example  given  by  Forsythe 
[Ref .  el  . 

Find  the  variance  of  the  following  set  of  numbers: 
48499,  48503,  48500,  *8498,  48500 
The  definition  for  the  variance  is 


i  ^   (X;   -   X)^ 


N  -  1  i  -  ^       ' 

where    S*    =   variance 

imj 
ip] 

This  formula  can  be  expanded  to  the  following  form: 


N   =  number  of  observations 
X  =  sample  mean. 


M  -  1   I  =  1     ' 
Use  of  this  formula  in  the  Apple,  however,  would  yield  a 
variance  of  zero,  simply  because  the  required  accuracy  is 
not  available  for  computations.   The  answer  of  zero  is,  of 
course,  incorrect.   Forsythe  in  his  article  on  statistical 
computing  offers  the  following  alternative  algorithm  to 
compute  the  variance. 


13 


saM  :=  0.0; 

S2  :=  0.0; 

READ  (N)  ; 

FOR  I  :=  1  TO  N  DO 

BBGIN 

READ  (X)  ; 

DEVIATION  :=  X  -  SdM; 

son  :=  SUM  >  DEVIAriON/I; 

S2  :=  S2  +  DEVIATION  *  (X  -  SJM)  ; 
END; 
S2  :=  S2/(N  -  1)  ; 

This  algorithm  prodaces  the  correct  answer,   variance  = 
3.5  .   It  also  illustrates  how  many  of  the  limitations  of 
the  microcomputer  can  be  overcome  through  careful  selection 
of  algorithms. 

B.   DISTRIBDTI3NS  AND  INVERSES 

The  algorithms  used  to  compute  probability  distributions 
and  inverses  and  the  source  of  each  are  listed  in  Appendix 
A.   When  compared  to  the  tabular  values  listed  in  Dixon  and 
Massey  [Ref .   7j  ,  the  algorithms  are  accurate  to  at  least 
three  decimal  places  in  probability  with  the  exception  of 
the  F  distribution.   The  F  distribution  is  accurate  to  three 
decimal  places  in  probability  in  almost  all  cases;  however, 
some  values  may  differ  from  the  listed  tabular  values  by  as 
much  as  .002  in  probability.   Although  the  F  guantiles  may 
differ  slightly  from  the  listed  tabular  values,  the 
probabilities  corresponding  to  the  values  given  by  the 
algorithm  are  accurate  to  three  decimal  places.   All  of  the 
algorithms  produce  results  within  three  seconds  except  the  T 
distribution.   when  computing  for  very  large  degrees  of 
freedom  for  the  T  distribution,  computation  tine  is  a 
function  of  the  degrees  of  freedom.   Typically  1000  degrees 
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of  freedom  takes  approximately  eight  seconds. 

Often,  increased  accuracy  can  only  be  gained  through 
iterative  techniques.   This  is  the  case  with  aany  of  the 
probability  distributions  and  their  inverses,  the 
T  distribution  being  an  example.   In  the  algorithms  shown  in 
Appendix  A  much  of  the  excessive  computation  time  has  been 
alleviated  by  combining  two  or  more  algorithms.   For 
example,  in  the  Chi-square  distribution,  the  F  distribution 
and  the  inverse  F  distribution,  one  algorithm  is  used  for 
small  degrees  of  freedom  and  another  for  large  degrees  of 
freedom.   Good  algorithms  for  these  distributions  exist  for 
large  degrees  of  freedom  which  are  not  based  on  iterative 
techniques  and  hence  are  computationally  fast.   However,  for 
small  degrees  of  freedom,  their  accuracy  falls  off  rapidly. 
Conversely,  the  algorithms  using  iterative  techniques  are 
very  accurate  at  all  ranges,  but  slow  for  the  larger  degrees 
of  freedom  since  the  number  of  iterations  required  is 
proportional  to  degrees  of  freedom.   The  break  between  small 
and  large  degrees  of  freedom  is  purely  subjective,  based  on 
choosing  the  best  combination  of  speed  and  accuracy. 
Selection  of  the  algorithms  themselves  was  likewise  based  on 
the  best  combination  of  speed  and  accuracy. 

C.  POISSON  AND  BINOMIAL  DISTRIBUTIONS 

Large  mainframes  have  the  capability  of  conputing  the 
Poisson  and  Binomial  Distributions  directly  from  their 
definitions. 

POISSON: 

If   the  random   variable   X    is   distributed    Poisson    with 

parameter   [Xj 


then    P   X   <=    k      = 


e-^ 


j=o  j! 
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BINOMIAL: 

If   the  randoi    variable    X    is   distcibutad   Binomial   with 


01    \ra.r 
parameters    ^n,p|, 

then   pfx   <=   kl     =    C 
L  J         j  =  o 


/n 


P'  0-9) 


n-3 


Both  of  thase  exprassions  contain  factorials  and 
summations;   consequently  for  larga  values  of  'k'  in  the 
Poisson  and  'n'  and  *<•  in  the  Binomial,  execution  time 
might  be  excessive,  or  the  intermediate  valuas  in 
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computation  might  exceed  MO   *  (tie  maximum  number  capable 
of  being  represented  oa  the  Apple).   A  better  solution  for 
the  calculation  of  thasa  probabilites  is  to  usa  the 
Chi-square  idaatity  for  the  Poisson  and  the  F  Identity  for 
the  Binomial  |Ref .   8,  Ref .   9j  . 

POISSON: 

Given   X  ^^  Poisson 

then  Pr  [x  <=  k]  =  1  -  x''(^^^^2.)    ^^A) 

where  2k  +  2  are  the  degrees  of  freedom 
of  the  Chi-square  variite. 


-^1 


BINOMIAL: 

Given      X   ^'^  Binomial      |.^»Pj 

then  P  [x  <=  k]  =  P  [^^zn-n.l^.t}    <=  ^  •  n. 

where  2n-2k  equals  the  degrees  of  freedom  for  the  numerator 
and  2k-»'2  equals  the  degrees  of  freedom  for  the  denominator. 

When  k  =  n,  the  probaoility  is  1.0. 

D.   STATISTICAL  ALGORITHMS 

Derivation  of  the  algorithms  used  in  findiag  confidence 
intervals  and  hypothesis  testing  is  shown  in  3torer 
[Ref.  1o]  . 
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17.   DESCRIPTION  OF  PACKAGE 

The  options  available  to  the  user  are  shown  in  the  block 
diagran  (Figure  1)  .   ifhen  the  program  is  execated,  the  aser 
begins  in  the  "outer  level".   To  proceed  he  must  select  from 
one  of  the  seven  options  which  are  shown  on  a  "menu"  (Figure 
2). 

A.   THE  DATA  ENTRY  MODOLE 

Selecting  option  *S*    from  the  outer  level  menu  (Figure 
2)  will  cause  the  data  entry  module  menu  (Figure  3)  to 
appear  on  the  screen. 

1.   General 

The  data   entry    requirements   are   intended   to   be  as 
trouble-free   as    possible    for   the   user.    The  user    is    prompted 
for   data    input   by   the   following   line: 

Record   N   — > 

• N«  is  an  integer  sequentially  updated  by  the 
program  when  the  'return*  key  is  pressed.   Following  the 
arrow,  the  user  inputs  as  many  data  values  as  he  wishes 
with  entries  separated  by  one  or  more  spaces.   The  only 
restriction  is  that  he  should  not  exceed  the  length  of  the 
display  line.  The  nomenclature,  record,  indicates  only  a 
logical  or  convenient  grouping  of  data  from  the  user's 
point  of  view. 

Prior  to  entering  any  data,  the  user  is  asked 
whether  or  not  the  observations  he  is  entering  are  paired. 
Since  two  of  the  statistical  procedures  are  predicated  on 
paired  data,  answering  'yes'  to  this  question  will  cause 
summary  statistics  to  be  computed  on    •X,Y*  pairs.   These 
summary  statistics  are  only  good  for  use  with  the 
procedures  requiring  paired  observations.   All  data  entered 
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1)  Hypothesis  Tasting  One  Parameter 

2)  Hypothesis  Testing  Two  Parameters 

3)  Confidence  Intervals  Single  Population 

4)  Confidence  Intervals  Bivariate  Populations 

5)  Distributions  and  Inverses 

6)  Data  Entry 
Q)  uit 

Figure  2.   The  Outer  Level  Menu 


Instructions 

Create  a  new  data  file 

Correct/Add  to  existing  datafile 

Enter  data  without  storing 

Review  existing  datafile- 

Review  summary  statistics  of  existing  file 

uit  and  return  to  outer  level 


Figure  3.   The  Data  Entry  Module  Menu 
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must  be  in  *1,I*    pairs  with  the  • X*  observation  listed 
first.   All  other  statistical  procedures  and  their  attendant 
data  entry  requirements  assume  that  all  of  the  observations 
come  from  a  single  population.   These  general  cequirements 
are  available  to  the  user  when  ha  selects  option  M'  from 
the  data  entry  menu  (Pigure  3)  . 
2.   Create  A  New  Data  File 

Selecting  option  '2*  (Figure  3)  will  prompt  the  user 
to  specify  a  file  name  for  the  data  observations.  After 
responding  with  a  file  name,  the  operating  system 
establishes  a  directory  entry  at  the  beginning  of  the 
largest  unused  blocJc  of  space  containing  at  least  fifteen 
blocJcs  on  the  specified  disk.   Because  the  filing  system  in 
the  University  of  California,  San  Diego  (OCSD)  Pascal 
language  is  random  access,  each  data  file  entered  will  have 
allocated  fifteen  blocks  of  space  to  insure  that  there  is 
enough  room  to  extend  the  file  if  the  user  desires  to  do  so 
at  some  later  time. 

The  floppy-disk  (5  1/4  inch  diameter)  used  by  the 
Apple  System  provides  a  storage  space  of  280  blocks.   This 
results  in  a  capability  to  store  seventeen  data  files  on 
each  disk.   Each  data  file  can  contain  a  maximum  of  ninety 
records.   Since  each  record  can  contain  as  many  observations 
as  the  user  desires  provided  that  it  does  not  exceed  the 
length  of  the  display  line,  a  reasonable  planning  figure  is 
eight  data  observations  per  record.   This  results  in  an 
upper  limit  of  approximately  720  data  entries  per  data 
file. 

Data  entry  for  a  new  file  begins  with  *  Record  1*. 
The  user's  only  concerns  when  entering  data  should  be  to 
separata  each  observation  by  one  or  more  spaces,  to  not 
exceed  the  length  of  the  display  line,  and  to  enter  'X,!* 
pairs  if  he  has  previously  indicated  paired  observations. 
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Si-Qce  ttie  data  is  entered  as  a  string  variable  and  then 
coQverted,  it  is  possible  to  correct  any  entry  prior  to 
goiag  to  the  next  record  by  simply  using  the  •back  arrow' 
kay  to  aove  to  the  place  at  which  the  correction  is  to  be 
mais.   The  Legal  symbols  which  may  be  used  in  data  entry  are 
the  digits  *  0*    through  •9*,  the  decimal  point,  the  comma, 
th2  plus  (>)  and  minus  (-)  signs,  and  the  •  E*  for  scientific 
notation.   Commas  are  ignored  by  the  procedure  which 
converts  the  strings  and  are  included  only  as  a  convenience 
for  the  user. 

Pressing  the  •retirn*  key  at  the  end  of  a  record 
taciinates  that  record  and  prompts  the  usee  to  input  the 
next  record  in  sequence.   If  the  user  inadvertantly  enters 
an  illegal  character  while  entering  data,  the  program  will 
aivise  hia  of  this,  indicate  what  the  character  was,  and 
proapt  hia  to  reenter  the  record. 

Once  the  user  has  entered  all  of  his  data,  he  must 
prass  the  'escape'  key  and  the  'return*  key  following  the 
last  data  entry.   This  will  close  the  data  file.   When  the 
fils  is  closed,  the  following  summary  statistics  are 
computed  on  the  observations  in  the  file: 

a.  The  Sample  Clean 

b.  The  Sample  Standard  Deviation  (N-1) 

c.  The  Sum  of  the  Observations 

d.  The  Sum  of  Squares  of  the  Observations 

e.  The  Number  of  Observations 

For  paired  data,  the  same* statistics  are  computed; 
however,  they  are  computed  on  the  differences  of  the  *l,I* 
pairs.   Hence,  the  number  of  observations  for  a  data  file  of 
paired  observations  is  exactly  half  of  the  total  number  of 
data  entries. 
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The  summary  statistics  for  each  data  file  are  kept 
in  a  separate  file  that  is  initially  established  on  the  same 
disk  as  newly  created  data  files.   The  file  of  summary 
statistics  reqiires  one  block  of  disk  storage  space.   It  is 
differentiated  from  the  original  file  of  observations  by  an 
'S*    concatenated  to  the  original  file  name. 
3.   Correct/Add  to  Existing  Data  File 

Selecting  optiDn  '3*     (Figure  3)  while  in  the  data 
entry  module  will  prompt  the  user  to  specify  a  file  name. 
Once  the  data  file  is  retrieved,  the  monitor  will  show  the 
name  of  the  file,  the  number  of  records  in  the  file,  and  the 
number  of  observations.  Immediately  after  this  information 
is  the  following  prompt: 

Enter  Record  dumber  — > 

Selecting  any  record  number  between  M»  and  the 
total  number  of  records  in  the  file  will  cause  the  retrieval 
of  that  record  and  will  display  as  follows: 

Old  Record: 

Hecord  2  — >   1  2  3  !*  5 

You  may  replace  the  complete  record  or 
press  <HTN>  for  no  changes. 

Record  2  — > 

Pressing  'return*  leaves  the  existing  record 
unchanged  and  prompts  the  user  to  anter  another  record 
number.   If  corrections  are  to  be  made,  the  user  must  enter 
the  complete  new  record  opposite  the  lower  prompt  and  press 
•return*.   The  updated  record  will  display  as  follows: 

Record  2  — >  12  3  3  5 

Press  <aTN>  if  OK,  <E5C>  if  not. 

Pressing  'return*  completes  the  update  and  prompts 
for  a  new  record.   Pressing  'escape'  produces  the  following 
display: 
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Enter    corrected   data  and   <RTN> 
fis^Drd   N   — > 


The   user   then   may    retype  the   line.      This  sequence 
aay   be   repeated   until   the   record   appears    as    the  user   wants. 
Entire   recocds   are   erased   or   deleted  by  typing   a   space  to 
produce   a  blank   line.      If   the  user   desires    only   to   update   an 
existing   data   file   and  not    extend  it,    he   types   the    number, 
•-!',    ia   response   to   tha    prompt    for   record    number,    whica 
closes   the   updated   file. 

If   the   user  selects   a   record   number    that   is   greater 
than    the   number  of   records   in  the  file,   he    reenters   the   data 
entry    phase   beginning   with    the   record   number   immediately 
following  the  last  existing    record.      For    axample,    if  there 
ace   thirty-seven   existing    records  in   tha    file,    selecting    any 
number   greater   than  thirty-seven   will   produce  the   following 
display: 

Enter    new   records. 

Entar    <ESC>    as   the   last   entry   and  <RTN>  to    terminate   input. 

Se::ord   38  — > 

From  this  point,  the  user  proceeds  exactly  as  if  he 
were  in  the  data  entry  phase  and  terminates  by  pressing  the 
•escape*  key  immediately  following  the  last  entry. 
4.   Enter  Data  Without  Storing 

This  option  allows  the  user  to  compute  the  mean, 
standard  deviation,  sum  of  observations,  sum  of  squares  of 
observations,  and  number  of  observations.   The  format  for 
entering  data  is  exactly  the  same  as  previously  discussed 
foe  creating  a  new  file.   As  indicated,  data  is  not  stored 
under  this  option;   hence,  once  a  record  is  terminated,  the 
entries  for  that  record  cannot  be  recovered  nor  changed. 
This  option  provides  an  expedient  way  to  determine  the 
sunaary  statistics  of  a  group  of  observations. 
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5.  Review  an  Existing  Data  File 

Selecting  this  option  allows  the  user  to  quickly 
review  any  or  all  of  an  existing  data  file  in  blocks  of  ten 
records  at  a  time.   The  user  is  prompted  to  eater  the  file 
name  of  an  existing  file.   The  program  retrieves  that  file 
and  displays  the  first  ten  records.   Pressing  'return*  at 
this  point  causes  the  next  ten  records  to  be  displayed, 
etc.   Pressing  the  'escape'  key  at  the  end  of  any  display, 
returns  the  user  to  the  menu  for  the  data  entry  module 
(Figure  3) . 

6.  Review  Summary  statistics  of  an  Sxistina  File 

All  files  that  are  stored  on  disk  have  an  associated 
summary  statistics  file  that  is  created  by  the  program  when 
a  newly  created  or  updated  data  file  is  closed.   This  file 
contains  the  following  information; 

a.  Mean 

b.  Standard  Deviation 

c.  Sum  of  Observations 

d.  Sum  of  Squares 

e.  Number  of  Observations 

The  summary  statistics  file  only  is  called  when 
specifying  a  data  file  to  be  used  in  the  other  modules 
containing  the  statistical  procedures.   Because  of  this,  it 
is  not  necessary  to  have  the  original  data  file  on-line  when 
performing  the  statistical  procedures;  only  the  summary 
statistics  file  is  required.   When  using  the  filing  system 
resident  in  the  Apple  Pascal  language  system  to  obtain 
directory  listings  of  various  disks,  the  summary  statistics 
file  is  distinguished  by  a  concatenated  'S'  on  the  end  of 
the  original  file  name.   For  example,  if  the  original  file 
name  was  STATrDATAI,  then  the  corresponding  summary 
statistics  file  is  named  STAT:DArAlS.   Since  the  length  of 
all  summary  statistics  files  is  one  block,  it  is  possible  to 
access  a  maximum  of  27'4  files  of  summary  statistics  on  any 
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0Q8  disk  of  storage. 
7.  Q)    dit 

Pressing  the  • Q*  key  will  return  the  user  to  the 
outer  level  (Figure  2). 

B.   DISTRIBUTIONS  AND  QOANTILES  MODDLE 

Selection  of  any  of  the  options  from  the  distributions 
and  guantiles  module  mean  (Figure  4)  will  produce  further 
prompts  which  require  the  user  to  enter  the  necessary 
information  concerning  values  of  the  random  variable, 
degrees  of  freedom,  and  probabilities  as  appropriate. 
After  each  computation  the  following  prompt  appears: 

C)  OQtinue  or  Q)  uit 

Pressing  the  •€•  key  will  allow  the  programmer  to 
continue  calculation  in  the  previously  selected  distribution 
or  guantile.   Pressing  the  *Q*    key  will  return  the  user  to 
the  menu  for  this  module  (Figure  4) . 

The  Pascal  language  system  allows  the  user  to  develop 
his  own  specialized  libraries  of  often  used  subroutines  for 
general-purpose  or  special-purpose  computations.   It  is  in 
such  a  library  that  the  algorithms  for  the  distributions  and 
guantiles  are  kept.   Osing  a  special  library  has  two  major 
advantages.   First,  when  the  user  is  developing  the  main 
program,  he  is  not  penalized  by  extra  compilation  time  for 
an/  of  the  routines  in  the  library.   The  code  in  the  library 
is  linked  by  a  separate  process  jRef.  ul .   Second,  the 
Library  can  be  .used  by  other  programs  which  require  the  use 
of  the  algorithms  contained  therein.   The  algorithms  for  the 
distributions  and  guantiles  fit  logically  in  a  library  since 
it  is  likely  that  other  statistical  packages  will  requira 
thsLr  use.   The  Apple  reference  manual  explains  the 
procedure  used  to  establish  new  libraries   Ref.  11j* 


25 


normal  Distribution 
T  Distribution 
Chi-Sguare  Distribution 
P  Distribution 
Binomial  Distribution 
Poisson  Distribution 

Normal  Quantilas 
T  Quantiles 
Chi-Sguare  Quantiles 
F  Quantiles 


Q)  uit  and  return  to  outer  Isval 


Figure  4.   The  Distributions  and  Quantiles  Module  Menu 


Population 

^)  Normal 

2)  Normal 

3)  Normal 
U|  Normal 

5)  Bernoulli 

6)  Exponential 

Q)  uit    and   return 


POPULATION    ASSOMPTIONS 


Assumptions 

u?,sigma-sqr    known 

u    &   sigma-sqr   ? 

u    5  sigma-sqr    ? 

u    known,    sigma-sqr   ? 


to   outer    level 


Parameter 

u 
u 

sigma-sq 
sigma-sq 

MTTF 


u  ==>   mean 

sigma-sq  ==>   variance 


Figure  5.      Confidence  Intervals   Single    Population   Menu 
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C.   CONFIDENCE  INTERVALS  AND  HYPOTHESIS  TESTIN3  MODDLES 

Selecting  aither  the  confidence  intervals  or  hypothesis 
testing  options  will  caase  the  mena  for  that  modale  to  be 
displayed.   All  of  these  aenus  contain  information  similar 
to  that  shown  in  figure  5  for  confidence  intervals. 

The  parameters  about  which  an  interval  is  to  be  computed 
or  a  hypothesis  is  to  be  made  are  listed  in  the  right  hand 
column  of  the  menu.   The  distribution  from  whi^h  the 
observations  came  and  assumptions  ibout  the  populations  are 
listed  in  columns  one  and  two,  respectively. 
1.   Data  Requirements 

When  one  of  the  options  is  selected  that  does  not 
involve  Bernoulli  or  Poisson  populations  the  following 
display  appears: 

1)  Use  existing  data  file 

2)  Enter  data  and  store  it 

3)  Enter  data  w/o  storing 
H)  Use  summary  statistics 

Because  of  the  nature  of  the  observations,  any  tests 
or  intervals  involving  Bernoulli  or  Poisson  observations  are 
entered  using  summary  statistics;   hence,  for  these  cases, 
this  display  is  skipped. 

a.  Use  existing  Data  File 

The  user  is  prompted  for  the  name  of    the  data 
file.   When  the  file  name  is  entered,  the  program  retrieves 
the  summary  statistics  file  associated  with  that  file  from 
disk.   In  the  case  of  bivariate  populations,  two  file  names 
are  needed,  the  first  containing  the  'X*  observation  and  the 
second  containing  the  '!•  observation.  Paired  observations, 
as  noted  previously,  are  entered  in  one  file. 

b.  Enter  Data  and  Store  it 

Selecting  this  option  will  display  a  short 
message  to  the  user  informing  him  that  all  data  storage  must 
be  accomplished  from  the  data  entry  module  (Figure  3) .   The 
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user  has  the  option  at  this  point  to  enter  the  data  withDut 
storing  it  or  to  return  to  the  outer  level  (figure  2)  aai 
select  the  data  entry  module. 

c.  Enter  Data  a/0  Storing  It 

Data  is  entered  in  the  same  fDrmat  as 
prejriously  discussed  in  the  section  pertaining  to  the  data 
eatry  module.  No  permanent  disk  record  is  made  o£  the 
entries.   Hence,  once  each  record  of  observations  is 
terminated,  there  is  no  way  to  retrieve  it  to  make 
corrections.   Where  bivariate  populations  are  required,  the 
user  is  prompted  to  enter  all  of  the  • X»  observations  first 
and  all  of  the  • !•  observations  second.   For  paired 
observations  all  of  the  •X,!'  pairs  are  eitered  as  one 
population. 

d.  Use  Summary  Statistics 

On  all  other  tests  or  intervals,  use  of  suimary 
statistics  is  optional  except  as  previously  mentioned  for 
data  from  Bernoulli  or  Poisson  populations.   At  each 
prompt,  the  user  is  asked  only  for  the  necessary 
information  to  perform  the  statistical  procedure  he  has 
selected.   The  distinction  is  made  in  each  prompt  whether 
or  not  the  statistics  required  are  the  sample  parameter 
values  (sample  mean,  sample  standard  deviation)  or  the  true 
parameter  values  (true  mean,  true  standard  deviation) . 
2.   Confidence  Intervals 

When  computing  confidence  intervals,  the  user  must 
supply  the  additional  information  concerning  the  desired 
level  of  confidence  and  the  type  of  interval  (two-sided, 
one-sided  upper,  one-sided  lower).   Typically,  computation 
of  the  desired  interval  takes  one  or  two  seconds  and  is 
displayed  as  follows: 
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95.00  Percent  confidence  intervals  for  a 

Sample  aean  =   3.000 
Standard  deviation  =  1.581 
Upper  =   4.964 
Lower  =   1.036 

Another  interval  using  same  data,  Y)es  N)  o  — > 

Following  the  display,  the  user  is  asked  whether  or 
not  he  desires  to  compute  another  interval  using  the  same 
data.   If  he  responds  'yes',  he  lay  then  vary  the 
confidence  leval  and/or  the  type  of  interval  without 
having  to  again  specify  the  data  base.   Answering  'no*  will 
return  the  user  to  the  menu  for  the  module  in  which 
computations  are  currently  being  performed. 
3.   Hypothesis  Testing 

Hypothesis  tests  require  the  user  to  specify  the 
null  hypothesis.   Typically,  the  hypotheses  involve  'equal 
to»,  'less  than  or  equal  to*,  or  'greater  than  or  equal  to* 
comparisons  and  are  displayed  for  the  user  in  a  form 
similar  to  the  one  below: 

1)  u  =  u  [Ol 
2  u  <=  u  [0] 
3)  u  >=  u  [03 

The  symbol    loj    represents   the   null   hypothesis 
value.    The  user   enters   this   value   if   required   by   the  test. 
An   example  display   following    computation    is   as    follows: 

HYPOTHESIS:       U    =    u  [01 
Sample   mean   =  3. ODD 

u  [Of    =      2.500 
The   P-value   is    0.519 

Another  test   using   the   same   data.    If)  es    M)  o  — > 

The   user   is   not    told    to   accept    or   rejact    the 
hypothesis;      rather,    ha    is  given   a    p-value   as    shown   above. 
The   p-value,    or    probability   level,    is  an   indication   of   the 
level    of   confidence   associated   with    the   hypothesis    JRef. 
12].      High   p-values   convey   a    high    confidence   in    the   null 
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hypothesis;      conversely,    low    p-valaes   reflect    a    lack  of 
confidence   in   the  validity   of   the   hypothesis.      A   p-value   of 
.05,    for    example,    indicates   that   if   the   hypothesis   is   indeed 
true,    there  is   only   one    chance   in   twenty   that   the   data   used 
in   the  test   is   consistent   with  the   hypothesis,      Upon 
completion  of   the   test,    the   user   has   the   option    to    perform 
another  test   with   the   same  data   or   return   to   the   menu   for 
the  module  in   which   tests   are   currently   being   performed. 
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V.   C0NCLJSI3NS 

The  decada  of  the  i930»s  promises  to  be  particularly 
bright  in  tens  of  affording  the  operations  research  analyst 
easy  access  to  computing  power.   Microcomputers  now 
available  and  their  more  capable  descendants  will  doubtless 
play  an  important  role  in  securing  this  access.   However, 
equally  as  important  are  the  software  packages  which  will 
accompany  these  computers. 

The  software  package  described  in  this  papar  provides 
the  analyst  with  a  useful  set  of  statistical  tools  which  can 
be  used  on  one  of  the  most  popular,  current  microcomputers, 
the  Apple.   The  five  major  modulas  contained  in  the  package 
(confidence  intervals  for  single  and  bivariate  populations, 
hypothesis  testing  for  one  and  two  parameters,  and  the 
distributions  and  quaatiles)  are  designed  to  bs  easy  for  the 
analyst  to  use  and  to  cushion,  as  much  as  possible, 
potential  user  mistakes.   The  algorithms  used  throughout  the 
program  were  chosen  on  the  basis  of  being  compatible  with 
the  microcomputer  with  respect  to  size  and  computing 
precision  and  providiag  the  best  combination  of  speed  and 
accuracy. 

Pascal,  the  programming  language  used,  offars  not  only 
the  advantages  of  portability  ani  increased  computational 
speed,  but  also  flexibility.   Pascal  is  flexible  in  that 
large  complex  programs  are  programmed  in  modular  segments 
which  are  then  combined  into  the  overall  program.   It 
follows  that  programs  developed  in  this  way  are  easily 
enhanced  by  the  addition  of  new  modules.   Such  is  the  case 
with  this  statistical  package  which  could  be  significantly 
enhanced  by  the  addition  of  a  rsgrassion  and  aa  analysis 
of  variance  module. 
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This  package  and  those  to  follow  which  ace  compatible 
with  current  and  future  microcomputers  can  have  a 
significant  impact  in  the  analyst  community  in  two  key 
areas.   First,  the  analyst  can  be  better  educated.   Simply 
alleviating  the  tedium  which  accompanies  the  application  of 
many  statistical  procedures  will  give  fledgling  analysts  the 
opportunity  to  work  more  problems  and  be  exposed  to  a 
greater  variety  of  situations  in  the  school  environment. 
Perhaps  of  equal  importance,  the  educational  environment  can 
provide  the  opportunity  to  accustom  the  analyst  to  the 
capabilities  that  can  and  should  be  available  for  his  use  in 
a  working  environment.  Second,  by  expanding  computing  power 
into  areas  which  were  not  privileged  to  have  it  before,  the 
educational  process  has  a  better  opportunity  to  continue. 
Today,  it  is  reasonable  to  assume  that  the  professional 
growth  of  many  analysts  is  stifled  from  a  lack  of  computing 
machinery  with  which  to  attack  his  problems. 

Taking  full  advantage  of  the  microcomputer's  hardware 
capabilities  requires  efficient  compatible  software.   In 
specialized  areas  such  as  operations  research,  the  analysts 
themselves  must  logically  provide  the  bulk  of  the  effort  in 
software  development.   The  statistical  package  which  is  the 
subject  of  this  thesis  affort  scarcely  begins  to  provide  the 
full  complement  of  tools  which  the  analyst  requires.  If  the 
operations  research  community  is  to  take  advantage  in  a 
timely  manner  of  the  new  opportunities  afforded  by 
microcomputers,  effort  must  continue  now  in  software 
development. 
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APPENDIX    k 


FUNCTION    Z 


*  * 

*  NORMAL    DISraiBUTION  * 

*  SOME    COMMON    BASIC    PROGRAMS  * 

*  3RD    ED.,    P.     128  * 

*  * 

4t 


: 


:4(](c4(>i(«:^4c«*«»c«#:4c4c**«'!(«««4(4c4c:4e)0c4(4c'te>tc««4(4c4e4c4c4( 


CONST 

CI    =   0.4361836: 
C2    =  -0.  120  16  76; 
C3    =   0.937298; 
C4    =   0.33267: 
C5    =   2.50662S3; 


VAR 

XX,XT,R,T 


SEAL; 

THEN 


BEGIN 

IF    STDEV   <=    0.0 
BEGIN 

ERROR; 

EXIT  (Z)  ; 
END 
ELSE 
BEGIN 

XT  •  =  X* 

X  :=  ABS((X-MEAN) /STDEV)  ; 

XX    :=   X*X; 

R    :=    EXP(-XX/2.0» /C5; 

X  :=  1.0/(1.0  +  Ci**ABS(X)); 

T    :=    0.5   -a*(C1*X   *C2*X*X  +C3*X*X*X) ; 

IF  XT  <  MEAN  THEN 
Z    :=  0.5    -    T 

ELSE 

Z    :=  T   ••-    0. 5; 
END; 
END;  (*  Z  *) 
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FUNCTION    INVZ    ; 

*  * 

*  NORMAL    QdANTILES  * 
{*       HANDBOOK    OF    M ATHEHATI CAL    FUNCTIONS       *i 

*  P.    933  * 

*  « 

:4c  « 

CONST 

CI  =  2.515517 
C2  =  0.802853 
C3  =  0.010323 
D1  =  1.432788 
D2  =  0.  189269 
D3    =    0. 001308 

VAR 

PT^T,NUM,DEN 


;    REAL; 
OR     (P    <'    0.0)     THEN 


BEGIN 

IF     (P    >=    1.0) 
BEGIN 

ERROR; 

EXIT     (IN7Z)  ; 
END 
ELSE    IF    P    >    0.5    THEN 

PT    :=    1.0    -    P 
ELSE 

PT    ' —    P* 
T    :=   SQRT*(LN(1'0/(PT*PT)))  ; 
NUM    :=    CI    >    C2*T    ■►   C3*T*T: 
DEN    :=     1.0    *    D1*T    *    D2*T*T    ■»•    D3*T*T*T 
IF    P    >    0.5    THEN 

INVZ    :=    T    -    NUM/DEN 
ELS  E 

INVZ    :=-(!-    NUM/DEN)  ; 
END;     (*    INVZ    *) 
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FUNCTION    T    ; 

*  *  1 
Hi  :^ 

*  T    DISTRIBanON  * 

*  CACM  * 

*  ALGORITHM    344  * 
>*  * 

*  * 

CONST    CI    =   0.63661977; 

VAR 

I,N    :    INTEGER: 
ANS,D2,P1,F2,T1 ,T2,Xr     :    REAL; 

BEGIN 

IF    DF   <    1    THEN 
BEGIN 

ERROR; 

EXIT  (T)  ; 
END 
ELSE 
BEGIN 

XT   :=   X; 


X    :=    ABS(X); 

T1    :=    X/SQRT(DF): 

T2    ;=    1.0/(1.0+T1*ri) ; 


IF   ODD    (DF)     THEN 
BEGIN 

ANS    :=    1.0-C1*ATAN (T1) ; 

IP    DF    <>     1    THEN 

BEGIN 

D2    :=    31*T1*T2: 
ANS    :=    ANS    -   D2; 
END: 

IF    DF    <>    3    THEN 
BEGIN 

F1    :=   0.0: 

N    :=    (OF-2)    DI7    2: 

FOR    I    :=    1    TO    N    DO 

BEGIN 

F2    :=    2.0*I-F1; 
D2    :=    D2*T2*F2/(F2  +  1)  ; 
ANS    :=    ANS-02; 
END; 
END; 
END 
ELSE 
BEGIN 

D2    :-   T1*SQRT  (T2)  ; 
ANS    :=    1.0-D2; 
IF    D2    <>    2    THEN 
BEGIN 

Fl    :=    1 .0: 

N    :=    (DF-2)     DI7    2: 

FOR   I    :=    1    TO    N    DO 

BEGIN 

F2   :=    2.0*I-F1; 
D2    :=    D2*T2*F2/(F2-H)  ; 
ANS    :=    ANS-D2; 
END;     (*    END    FOR    *) 
END; 
END:     (*    END    ELSE   ♦) 
IF    ANS    <    0.  0    THEN 
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ANS  :=  0. 0; 
IF  XT  <  0.0  THEN 

T  :=  ANS/2.0 
ELSE 

T  :=  1.0  -  ANS/2.0; 
END; 
END;  (*  T  *) 
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FUNCTION    INVT 


:4c«4i  41  «:(i:4ci«i  4c4c« 4( «4c 4i4( ♦♦  «4c *>«( «  :4(  *«**  «4c :4c4i  4c4c  *4c  * 4(  # 

*  * 

*  T    3aANriLE3  * 

*  CACM  * 

*  ALGORITHM   396  * 

*  ♦ 

CONST 

HALFPI    =    1.570796327; 

7AR 

PT,DEN,A,BrC,D,X, Y    :    REAL; 

BEGIN 

PT    ' —    P  * 

IF     (DF    <:    1)    OR    (P    >=    1.0)     OR    (P    <-    0.0)     THEN 

BEGIN 

ERROR; 

EXIT    (INVT)  ; 
END 
ELSE    IF    P    >    0.5    THEN 

P    :=    2.0*  (1  .0    -   P) 
ELSE 

P    :=    2.0*P; 
IF    DF   =    2   THEN 

INVT    :=    SQaT(2.0/(P*  (2.0    -    P))-2.0) 
ELSE 
BEGIN 

IF   DF    =    1    THEN 

BEGIN 

P    :=s  P*HALFPI; 

INVT    :=    COS  (P)/3rN  (P)  ; 

END 

ELSE 

BEGIN 


A 

B 
C 
D 
X 

Y 

IF 


-3.5)  ; 


=    1.0/ (DF    -   3.5 
=   t»8.0/iA*A)  : 
=    ((20700*A/B    - 


=    ((20700*A/B    -    98.0)*A    -    16.0)*A   +    95.36; 

=    ((9'».5/(B   ♦•    C)     -    3.0)/3    +    1.0)  *SQRT(A*HALF?I)  *DF; 

=   D*P; 


=    XPN(X.  (2.0/DP).)  ; 

I    >    0.05    *    A    THEN 
BEGIN 


X   :=   INVZ  (P*0.5)  ; 

Y  *  -    X*X* 
IF'dF    <    §    THEH 

C    :=   C    +    0.3*(DF  -    U.5)*(X   ♦    0.6); 
C    :=    (((0.05*D*X    -   5.0)*X    -    7.0)*X   -    2.0).*X    "►    B   ♦    C; 

Y  :=     ((i((0.4*f    ♦    6,3)*Y    +    36.0)*Y   *    9(4 . 5) /C-Y-3. 0)  / 
B  +  1.0)  *X; 

Y  :=    A*  (Y*Y) ; 

IF   Y    >0.002    THEN 

=    EXP(Y)     -    1.0 


Y 

ELSE 
Y 
END 
ELSE 

Y   :  = 


=    0.5*  (Y*Y)     ♦   Y; 


IF    PT    >= 

INVT    :=    SQaT(DF*y) 


( 1.  0/ f((DF> 6.  0)/(DF*y) -0.08  9*0-0.  8 22)  * 
DF>2.0)  *3.0)  +0.5/(DF«-4.0)  )  *Y-1.0)  * 
DF+1.  0)  /(DF  +  2.0)  ♦1.0/Y; 

.5    THEN 
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ELSE 

INVT    :=    -    SQRT(D?*Y) 
END; 
END; 
END;     (*    INVT    *) 
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PaNCTION  CHI3Q  ; 

*  CHI-SQUAfiS    DISTRIBaTION  *1 

*  SOMB    COMMON    BASIC    PROGRAMS  * 
I*                               (DF    <=    40)     P.    130  * 

*  HANDBOOK    OF    MArHEMAnCAL    FOMCTIONS       * 

*  (DF    >    UO)     P.    9U1  * 

*  * 

lie « 4i  «  :ic « « 3|citi 4e « :(i  4i 3*1  iK ♦  :«( 4e  « :«( 4c  4c  *  «  4i :4c it  i«e  :((:ec « «  « :4c  « 4c «  «  4c 3«c 

7AR 

I  ;  INTEGER: 

Y,POWER,TSaP,NUa,DEN, J,L,M  :  REAL; 

BEGIN 

IF  (DF  <  1)  OR  (X  <=  0.0)  THEN 
BEGIN 
ERROR; 

EXIT  (CHIS3); 
END 

ELSE  IF  DF  >  40  THEM 
BEGIN 

Y  :=  ((XPN(X/DF, 1.0/3.0) -1.0)  <-2. 0/(9. 0*DF)  I /SQRT(2.0)  ; 
I  :=  Y*3.0*SQRT(DF)  ; 
IF  Y  <  -4.3  THEN 

CHISQ    :=    1.0 
ELSE  IF  Y  >  4.3  THEN 

CHISQ  :=  0.0 
ELSE  CHISQ  :=  Z  (Y,0.0,1.0) ; 
END 
ELSE 
BEGIN 

DEN  :=  1.0; 
TEMP  :=  DF; 
REPEAT 

DEN  :=  DEN  *  TEMP: 
TEHP    :=    TEMP   -   2.0; 
UNTIL  TEMP  <  2.0; 
POWER  :=  (DF  >  1)  DIV  2; 
NUM  :=  XPN(X, POWER)  *EXP(-X/2.0) /DEN; 
IF  ODD  (DF)  THEN 

J  :=  SQRT  (2. 0/X/3. 1415926) 
ELSE 

J   :=    1.0; 
L    :=    1.0; 
M    :=    1.0; 
REPEAT 

DF    :=   DF   +   2; 
M    :=   M*X/DF; 
L    :=   L+H: 
UNTIL    M    <   0.0000001; 
L    :=    L-M; 
CHISQ    :=    J*NUM*L; 
END; 
END;     (*   CHISQ    *) 
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FUNCTION  INVCHI    ; 

*  CHI-SQUARE    aU&NTILES  * 

*  CACM  *' 

*  ALGORITHM    451  * 

*  * 

*  * 


C2 

= 

1.0604 

CS 

s 

2,2775 

38 

= 

2.5300 

311 

= 

-1.153 

cm 

= 

-2.237 

CI  7 

= 

3.  1255 

320 

= 

U.7139 

A2 

= 

-1.425 

\5 

= 

-1.091 

A8 

s 

-2.728 

All 

= 

2.6189 

A14 

= 

3.4837 

A17 

— 

-8.729 

38E-3; 

79E-2; 

1E-3; 

751E-2; 

368E-1 ; 

80E-3; 

41E-1: 

296E-2; 

121E-2; 

484E-4; 

lE-2: 

89E-5: 

713E-3; 


CONST 
CI      =    1.565326E-3: 
C4      =   -1.323293E-2; 
C7      =   -1.51390E-2; 
C10    =   5.  169654E-3; 
C13    =    2.607083E-2: 
C16    =   -8.426812E-4; 
C19    =    1.348028E-4; 
A1      =    1.264461E-2: 
A4      =   -5.886090E-3; 
A7      =    3.  135411E-3; 
A10    =    1.316872E-2; 
A13    =    5.40667E-5: 
A16    =   3.292181E-3; 
A19    =    1.0; 

7AR 

F,F1,F2,TEHP    :    REAL; 

BEGIN 

IF     (DF    <    1)     OR     (P    <=    0.0)     OR    (P    >==    1.0)     THEN 
BEGIN 
ERROR; 
EXIT     (INVCH 
END 
ELSE 
BEGIN 

P    :=    1.0    - 
IF   DF   =    1    r 
BEGIN 

TEMP  :=  I 
INVCHI 
END 
ELSE  IF  DF 

INVCHI  := 
ELSE 
BEGIN 

F    :=    DF; 
F1    :=    1.0 
TEMP   :=    I 

F2  :=  saa 

IF    DF    <     ( 

TEMP 


C3 

= 

-6.959 

C6 

= 

-8.986 

C9 

= 

-1.450 

C12 

= 

1. 1261 

C15 

= 

9.7804 

C18 

= 

-8.553 

C21 

= 

1.0000 

A3 

= 

1.40C4 

A6 

= 

-2.304 

A9 

= 

-9.699 

A12 

s 

-2.2  22 

A15 

= 

-7.274 

A18 

s 

4.7140 

356E-3; 

007E-3; 

117E-3; 

86  E- 2; 

99E-5i 

069E-3; 

886; 

83E-2: 

527E-^; 

681E-3; 

222E-1; 

761E-4; 

45E-1; 


I)  ; 


P; 

HEN 


NVZ(0.5*P)  ; 

=    TEMP*rEMP; 

=    2    THEN 
-2.0*LN (P) 


NVZn.O-P)  ; 
't)*rEMP; 


T(Ft)*rE 
2    *  ,TRUNC  (4.0*ABS^TEMP)  )  ) 

;r<JF2*C6)  *F2>C7)  *F1-»- 


(  ((  ((((C1*F2  +  C2) *F2+C3) 

<-C5»  *F2*C6)  *F2>C7)  *F1-»-  (  f  (  (  ( i38+C9  *F2) 
♦  CIO)  *F2+C11) *F2+C12)  «F2+CTJ)  *F2*C14) 


THEN 
2+C4) *F2 

38+C9*F2l 


ELSE 

TEMP    := 


INVCHI    :  = 
END; 


(in(C15*F24-C16l  *F24-C17)  *F2<-318)  *F2 
+  C19)  *F2+C20)  *F2*C21 

(  ((AUBL2*F2)^*Flt(  ( jA3>A4*F2)  *F2 


*F2 
)  *F1  + 


^:A5|i  *F2  +  A5)  )*?1^l  iii  (A7>A8*F2)  *F2^-A9)  *F2 
-  -~    -12))*F1>(((((A13*F2 


■••A10)  *F2>A'1  1)  ♦ 
♦  Al4i  *F2+A15)  * 
■••A18I  *F2  +  A19: 

TEMP*rEMP*TEMP*F; 


)")  *F1>(  (  (  (  (A13*F: 
F2«-A16)  *F2*A17)  *F2*F2 


KO 


END; 
END;        (* 


INVCHI    *) 


'4  1 


FONCTIO»    F1; 

*  * 

*  F    DISTRIBCJriON  * 

*  SHALL    DEGREES    0?    FREEDOM  * 

*  STATISTICAL    COMPUTING  * 

*  P.     1U  * 

*  * 

*  * 


CONST    PI    =    3.14  159265; 


VAR    ArA1,AiI1,AN2,AaT,D1,D2,D3,R,S1,S2,DELrXM,XK,r, 

TEM,C    :    REAL; 
I,M,N    :    INTEGER; 

BEGIN 

AMI     :=    DF1; 

AN2    :=    DF2; 

A    :=    DF1*X/(DF1*X-»-DF2)  ; 

A1    :=    1    -   A* 

IF    A1    <=    0.5    THEN 

A1    :=    1.0E-37; 
D1    :=   AN1*0.5; 
D2    :=    AN2*0.5; 
D3    :=   D1    4-    D2    -    1  .0; 
R    :=   0.0: 

51  :=   0.0; 

52  :=  0.0: 
DEL  :=  1.0; 
xa  '.-  1.0; 
XK  :=  1.0; 
C  :=  0.25; 
N    ' —    DF 2 * 

WHILE      C'<    0. 875    DO 
BEGIN 

M    :=    TRaNC(D2) ; 
M    :=    2   *    M; 
IF    M    =    N    THEN 
BEGIN 

N    :=   TR0NC(D2)-1; 
IF    N    >    0    THEN 
BEGIN 

FOR    I    :=    1    TO    N    DO 
BEGIN 

51  :=  DEL<-S1*R; 
D2  :=  D2  -  1.0; 
D3  :=  D3  -  1.0; 
TEM    :=    A1/D2; 

R  :=  D3*TEM; 

52  :=  (R+TEM)  *S2; 
END; 

END; 

51  :=  DEL+S1*R; 
DEL  :=  0. 0; 

T  :=  -1.0: 
D3  :=  -1.0; 

52  :=  &*S2; 

C  :=  C  *  0.5; 
END 

ELSE   (*  DEGREES  OF  FREEDOM  ODD    *) 
BEGIN 

N  :=  TRUNC(D2)  ; 

IF  N  <>  0  THEN 
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BEGIN 

FOR  I  :=  1  TO  N  DO 
BEGIN 

51  :=  DEL+S1*a; 
D2  :=  D2  -  1.0; 
D3  :=  D3  -  1.0; 
TEM  :=  A1/D2; 

R    :=    D3*TEM; 

52  :=     (R+TEM)  *S2; 
END;     (*    FOR    *| 

END; 

51  :=    XK*S1 ; 

52  :=   XK*S2; 
ART    :=    saRTjfAl)  ; 
Xa    :=    X[1*ART; 

T    :=    (XH-AET) /A1 ; 

D3    :=   -0.5: 

XK    :=   2.0/PI; 

C   :=   C*2. 0; 
END; 

IF   C    <=    0.875    THEN 
BEGIN 

D2    :=    D1; 

D3    :=   D2    +    D3; 

S2:=   SI; 

S1    :=   0,0; 

^1    •=   ^ • 

IF    A1    <4    0.0    THEN 

A1    :=    1.0E-37; 
M   :=  DF1; 
END; 
END;        (*    WHILE    *) 
IF    C   <    1.125    THEN 

DEL    :=    4.0/PI*ATAN(r) : 
F1    :=   xa*(S2    -    SI)    -    DEL; 
END;     (*   F1    *) 
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FaNCTIOU    F2; 


*  *  I 

*  * 

*  F    DISTRIBOTION  *> 

*  LARGE    DEGREES    OF    FREEDOM  * 

*  EDOCATIONAL    AND    PSYCHOLOGICAL    MEASaREidENr  * 

*  7.     25,    NO. 3,    P.     877-879  *i 

*  * 

CONST 

CI  =  0.196854;C2  =  0.115194;C3  =  0. 000344;  C'4  =  0.019527; 


VAR 


REAL; 


BEGIN 
IF  X  > 
BEGIN 

S  :- 

T  '.- 

Z    :  = 
END 
ELSE 
BEGIN 

S    :  = 

T    :  = 

Z    *  — 
END;* 

J   :=  2.0/9.0/S; 
K    :-  2.0/9.0/T: 


1    THEN 

DF1; 
DF2; 
X; 


DF2; 
DF1; 
1.0/X; 


Y    z-   ABSj[(1.0-ft)*XPNfZ,  (1.0/3.0)  I -1.0>J)/ 
SQRT(K*XPN(Z-  (2.0/3.0)  •     "" 

lY    :=    1.0    *    Y*  (C1+Y*  " 


Y*  (C1+Y 
XX    :=   0.5/XPN  (YY,4.3) 
'THEN 


(C 


)>J) ; 

2-^Y*  ( 


(C3  +  Y*C4)  )  ) 


IF  X  >=  1.0 
F2  :=  1.0 

ELSE 

F2  I-    XX; 
END;  (*  F2  *) 


-  XX 


FUNCTION 
BEGIN 

IN 


F; 


IF  (DF1  <  1) 


beg: 

ERROR; 

EXIT  (F)  ; 
END; 

IF  ((DF1  <  100)  AND  (DF2 
(DFl  <  20)  OR  (DF2  <  20) 

F  :=  F1  (X,DF1,DF2) 
ELSE 

F  :=  F2  (X,DF1,DF2) ; 
END;  (*  F  *) 


OR  (DF2  <  1)  OR  (X  <  0.0)  THEN 


<  100)  )  OR 

THEN 
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FUNCTION    IN7F    ; 


*  * 
4c  in 

*  P    QOANTILES  * 

*  BISECTION    SEARCH    FOR  * 

*  SMALL    DEGREES    OF    FREEDOa  * 

*  LARGE    DEGRESS    OF    FREEDOM  * 

*  HANDBOOK    OF    MATHEMAnCAL    FUNCTIONS  * 

*  P.     91*7  * 

*  * 

«:4c«*4(«:*4c4c*4c4c4c4(4t4(4c4(«4(4(4c«4(4c4c4(4c*4c«4(4c«4c4i««4(« 


CONST    EPS    =    0.005; 
7AR 


TEMP,PT,ENDR,ENDL,MIDPT,STEP, W,H,T,r,Z 
1)     OR     (DF2    <    1)     OR    (P    <=:    0.0) 


OR 


REAL; 

(P    >=    1.0)     THEN 


OR     (DF2    =    1)     THEN 


BEGIN 

IF     (DFl    < 
BEGIN 
ERROR; 

EXIT     (INVF)  ; 
END 

ELSE  IF  {DFl  =  1) 
BEGIN 

IF  DF1  =  1  THEN 

INVF  :=  SQR  (INVT(  (0.5*(1.0  +  P)  )  ,DP2)  ) 
ELSE 

INVF  :=  1  .0/SQR(INVT((1 .G-P/2.0) ,DF1)) ; 
END 
ELSE 
BEGIN 

Z    :=    INVZ(P) : 

Y    :=    (Z*Z   -    3.0)/5.0; 

H    :=    2.0    *    1.0/(1.D/(DP2-1.0)     <•    1 . 0/(DFl-1 .  0)  )  ; 

i    :~     (Z*SQRTrH+I)  ) /H    -    (1  .0/(DF1-1.0) -1.0/(DF2-1. 

]y   *    5.0/6.0    -2.0/(3-0*H)) ; 
T    :==    EXP(2.0*W)  : 

IF    (DF1    <=    15)     OR    (DF2    <=    15)     THEN 
BEGIN 

STEP    :=   0.1*T; 
ENDR    :=    T; 
REPEAT 

TEMP  :=  ENDR; 

ENDL  :=  ENDR  -  STEP; 

IF   ENDL    <=    0.3    THEN 

ENDL    :=    0.0; 
ENDR    :=    ENDL; 
PT    :=    F1     (ENDL-DF1,DF2)  ; 
UNTIL    (PT    <=    P)     DR     ((TEMP    - 
ENDR    :=    TEMP; 

IF    ABS(EHDR    -    ENDL)     >    EPS    THEN 
BEGIN 

WHILE     (ENDR    -    ENDL)     >    0.1    DO 
BEGIN 

T    :=     (ENDR    «•    ENDL)^/2.0; 
PT    :=    F1     (T,DP1,DF2) ; 
IF    Pr    <    P    THEN 

ENDL    :=    T 
ELSE 

ENDR    :=    T; 
END; 
END; 


0)) 


ENDL)     <=    EPS) 
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INVF  :=  (ENDL  ♦  ENDR)/2.0; 
END 
ELSE 

INVF  :=  r: 
END;  (*  ELSE  *) 
END;  (*  INVF  *) 
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APPENDIX    B 


PROCSDUaE    C0»7ERTSTRIN3; 


'if}liiti4t********^^***^Tiit**'if******y^***  ******* 
[*  * 

*  PROCEDURE    TO    CONVERT    A    STRIN3  * 

*  VARIABLE    TO    A    NiJMBER  * 

*************************  *:i)nlf***:^*i«f***7^** 


LABEL    1; 

7AR 

?LACEINT,SIGN,I    :    INTEGER; 
BFDEC    :     BOOLEAN; 
ACC,TACC, PLACE    :    REAL; 
PREVCH,CH    :    CHAR; 

BEGIN 

DATA    :=    CONCAT     (DATH,*     »); 

PREVCH    :==••; 

BFDEC    :=   TRUE; 

INT    :=    0: 

RL    :=   0.0: 

ACC    :=    0.0; 

PLACE   :=    1; 

SIGN    :=    1; 

FOR    I    :=    1    TO    LENGTH     (DATA)     DO 

BEGIN 

CH  ;=  DATA   I  ; 

IF  CH  =  PREVCH  THEN 

GOTO  1; 
PREVCH  :=  •/' ; 
IF  CH  IN  C»  0*  ..  'g*  ,'  .  «,* 
CASE  CH  OF 

•0»,M»,«2»r*3«,»4»,»5»,'6*,'7»r»8»,«9«: 
BEGIN 

IF  BFDEC  THEN 
BEGIN 

RL  :=  RL  *  PLACE  *    lORD     (CH)-48)  ; 
PLACE  :=  10; 
END 
ELSE 
BEGIN 

PLACE  : =  PLACE  *  0. 1: 
RL  :=  RL  >  PLACE  *  (ORD  (CH) -48) 
END; 


t  • .  f  f  1 1 


THEN 


END; 

BEGIN 
BFDEC 
PLACE 

END; 


FALSE; 


«-  • 


BEGIN 
RL 
IF 

I 
ELSE 
INT 
END; 


=  RL  *  SIGN 


A3S  (RL)  <  3^767.0  THEN   (*  MAX  INTEGER  *) 
NT  :=  TRaNC  (RL)  (*  IS  32767     *( 

:=  1; 


i»7 


BEGIN 

SIGN  :=  -1: 
PRE7CH  :=  *  • 
END; 
•,':GOTO  1; 
END;  (*CASE*) 
1:  END; 
END;  (*  CONVERISTRING  *) 
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